Novel Phylogenetic Network Inference by Combining Maximum Likelihood and Hidden Markov Models

نویسندگان

  • Sagi Snir
  • Tamir Tuller
چکیده

Horizontal Gene Transfer (HGT) is the event of transferring genetic material from one lineage in the evolutionary tree to a different lineage. HGTplays amajor role in bacterial genome diversification and is a significant mechanism by which bacteria develop resistance to antibiotics. Although the prevailing assumption is of complete HGT, cases of partial HGT (which are also named chimeric HGT) where only part of a gene is horizontally transferred, have also been reported, albeit less frequently. In this work we suggest a new probabilistic model for analyzing and modeling phylogenetic networks, the NET-HMM. This new model captures the biologically realistic assumption that neighboring sites of DNA or amino acid sequences are not independent, which increases the accuracy of the inference. The model describes the phylogenetic network as a Hidden Markov Model (HMM), where each hidden state is related to one of the network’s trees. One of the advantages of the NET-HMM is its ability to infer partial HGT as well as complete HGT. We describe the properties of the NET-HMM, devise efficient algorithms for solving a set of problems related to it, and implement them in software. We also provide a novel complementary significance test for evaluating the fitness of a model (NET-HMM) to a given data set. Using NET-HMM we are able to answer interesting biological questions, such as inferring the length of partial HGT’s and the affected nucleotides in the genomic sequences, as well as inferring the exact location of HGT events along the tree branches. These advantages are demonstrated through the analysis of synthetical inputs and two different biological inputs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Net-HMM Approach: Phylogenetic Network Inference by Combining Maximum Likelihood and Hidden Markov Models

Horizontal gene transfer (HGT) is the event of transferring genetic material from one lineage in the evolutionary tree to a different lineage. HGT plays a major role in bacterial genome diversification and is a significant mechanism by which bacteria develop resistance to antibiotics. Although the prevailing assumption is of complete HGT, cases of partial HGT (which are also named chimeric HGT)...

متن کامل

Probability , Statistics , and Computational Science Niko

In this chapter, we review basic concepts from probability theory and computational statistics that are fundamental to evolutionary genomics. We provide a very basic introduction to statistical modeling and discuss general principles, including maximum likelihood and Bayesian inference. Markov chains, hidden Markov models, and Bayesian network models are introduced in more detail as they occur ...

متن کامل

Computation of Restricted Maximum-penalized-likelihood Estimates in Hidden Markov Models

The maximum-penalized-likelihood estimation for hidden Markov models with general observation densities is described. All statistical inference, including the model estimation, testing, and selection, is based on the restricted optimization of the penalized likelihood function with respect to the chosen model family. The method is used in an economic application, where stock market index return...

متن کامل

Inference in Hidden Markov Models I: Local Asymptotic Normality in the Stationary Case

Following up on Baum and Petrie (1966) we study likelihood based methods in hidden Markov models, where the hiding mechanism can lead to continuous observations and is itself governed by a parametric model. We show that procedures essentially equivalent to maximum likelihood estimates are asymptotically normal as expected and consistent estimates of their variance can be constructed, so that th...

متن کامل

Coupling stochastic EM and approximate Bayesian computation for parameter inference in state-space models

We study the class of state-space models (or hidden Markov models) and perform maximum likelihood inference on the model parameters. We consider a stochastic approximation expectation-maximization (SAEM) algorithm to maximize the likelihood function with the novelty of using approximate Bayesian computation (ABC) within SAEM. The task is to provide each iteration of SAEM with a filtered state o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008